Perform EDA using Airbnb data

get airbnd data from a parquet file databrics github

load the data and create a spark RDD

review columns of RDD

see the data to get undestanding

understand the data types in the RDD

get list of numerical columns

crate map to visualize the location of airbnb properties

save the map to a HTML file

crate a correlation matix plot to get find correlations

price is has high correlation with accomidation, bathroom, bedroom and bath

price distribution plot

price distribtuion grouped with neighboorhood

price distribtuion grouped by number of bedrooms, bathroom, beds and number of reiews